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We study the problem of performing orthogonal qubit measurements simultaneously. Since these 
measurements are incompatible, one has to accept additional imprecision. An optimal joint mea- 
surement is the one with the least possible imprecision. All earlier considerations of this problem 
have concerned only joint measurability of observables, while in this work we also take into account 
conditional state transformations (i.e., instruments). We characterize the optimal joint instrument 
for two orthogonal von Neumann instruments as being the Liiders instrument of the optimal joint 
observable. 
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I. INTRODUCTION 

It is a fundamental fact of quantum theory that there 
exist pairs of incompatible measurements. The simplest 
example is a pair of (ideal) spin component measure- 
ments in different directions. These measurements can- 
not be measured jointly using a single device. 

The existence of incompatible measurements (i.e., im- 
possibility of certain joint measurements) is linked with 
some other impossible tasks, such as cloning and tele- 
portation. For each impossible device, one can study its 
best approximative substitute. This kind of optimal pos- 
sible device then gives an absolute bound for the error 
one has to face in any attempt to build the impossible 
device. Evidently, this kind of quantitative bound on the 
error can tell us much more than just a plain statement 
of impossibility. 

The question of approximate joint measurements of 
two sharp qubit observables (e.g., spin-1/2 components) 
was first studied in |l| . In recent years this topic has been 
investigated from several different aspects. The Mach- 
Zehnder interferometric setup was analyzed in [2|, |3| from 
the point of view of joint measurements. Various trade- 
off relations concerning joint approximations were de- 
rived in 1J-13|- Characterizations of all jointly measur- 
able two-outcome qubit observables were determined in 
f8l- [l0[ . A connection between the CHSH Bell inequal- 
ity |ll| and the bound on join t qubit measurements was 
observed in [IJ], and in [IJ] it was shown that every 
pair of two-outcome observables being not jointly mea- 
surable enables the violation of the CHSH Bell inequality. 
The relationship between cloning of observables and joint 
measurements was investigated in [ij] . 
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In the current work we study the question of approx- 
imate joint measurement of two sharp qubit measure- 
ments from a different perspective. In earlier works, dis- 
cussion has concerned only joint measurability of observ- 
ables. In this work we extend the problem to a joint 
measurability of instruments. In other words, we con- 
sider approximations not only to measurement outcome 
probabilities but also to conditional state transforma- 
tions. One of our main results is the characterization 
of the optimal joint instrument for two orthogonal von 
Neumann instruments. 

This paper is organized as follows. In Sec.|ll]we explain 
the two different levels of compatibility. Some useful de- 
tails on joint observables are presented in Sec. IIIII In 
Sec. IIVI a general form for joint instruments is derived. 
The optimal approximate joint instrument for two von 
Neumann instruments is then characterized in Sec. [V] 
Finally, in Sec. |Vl] we discuss the case of three von Neu- 
mann measurements. 



II. TWO LEVELS OF INCOMPATIBILITY 

Compatibility of quantum measurements has different 
meanings depending on what we take into consideration. 
In particular, two measurements can be compatible if we 
care only about the bare measurement outcome statis- 
tics, but fail to be compatible if we take into account the 
dynamics of the measurements. This fact is the motiva- 
tion for the current investigation and in the following we 
explain this twofold meaning in detail. 

In particular, let us consider two sharp observables X 
and Y on a qubit system. These can be, for instance, 
spin component measurements on a spin-^ system. The 
observables X and Y are described by the sclfadjoint op- 
erators (Tx = X • cr and Uy = y ■ cr , where cr ~ (cti, (T2, fs) 
are the Pauli matrices and x, y are unit vectors. Alter- 
natively, and for our purposes more conveniently, these 
observables can be described by projection valued mea- 
sures (PVMs). Then X and Y are identified as mappings 




x_{e) 




^+{q) 



FIG. 1: (a) An observable describes the measurement out- 
come statistics, whereas (b) an instrument describes both the 
measurement outcome statistics and the conditional output 
states with possible separation in accordance with the classi- 
cal outcomes. 



from a set of measurement outcomes to projeetors 

±1 ^ X(±l) = i(/ ± ax) , ±1 ^ Y(±l) - i(/ ± dy) . 

A measurement of X (similarly Y) gives either a result 
up (+1) or down (-1); see Fig. [T^. For instance, if the 
system is in a state p, then the probability of getting the 
outcome 1 in a measurement of X is tr [pX(l)]. The op- 
erator (Tx gives the average value of the X measurement, 
which means that the formula, 

tr[gX(l)]-tr[^X(-l)]=tr[eax], 

holds for all states g. 

In our following investigation we assume that the unit 
vectors x and y are orthogonal. This is equivalent to 
the condition that tr [X(i)Y(j)] = i for every i, j = ±1. 
Hence, certain predictability of one outcome of X implies 
that both outcomes of Y are equally likely, and vice versa. 
This relation is usually referred to as (value) complemen- 
tarity [15[. 

Our assumption on the orthogonality of x and y 
means, in particular, that the observablcs X and Y do not 
commute, i.e., X(i)Y(j) ^ Y(j)X(i). The noncommuta- 
tivity implies the impossibility of performing their joint 
measurement. Therefore, we need to choose whether we 
measure X or Y, their simultaneous measurement being 
impossible. 

It is possible to approximate X and Y with a pair of 
jointly measurable observables A and B described by pos- 
itive operator valued measures (POVMs) |16l . Il7| . An 



essential fact is that for POVMs (unlike for PVMs) com- 
mutativity is not a necessary condition for joint measur- 
ability. 

Suppose we want to approximate X and Y equally well. 
Then a class of approximating observables, parametrized 
by a number < 77 < 1, is defined by 



A(±l) = i(/±7?a 



B(±l) = i(/±77ay) 



The number 77 quantifies how close A and B are to X and 
Y, respectively. 

In the limiting case 77 = 1 we have A = X and B = Y, 
however, in such case A and B are not jointly measurable. 
It was shown in [l| that A and B have a joint measure- 
ment if and only if 77 < -j=. Therefore, we fix 7/ = -j= and 
A, B are then the optimal jointly measurable approxima- 
tions to X, Y. 

At this point one may wonder whether the 77- 
parametrized class of observables leads to the best ap- 
proximation, or perhaps some modification gives a bet- 
ter approximation (while preserving joint measurability). 
However, in 0, Q it has been proved that any modifica- 
tion to A and B leads either to a worse approximation or 
lack of joint measurability. 

A joint observable for the observables A and B is de- 
fined as a POVM G with four outcomes corresponding to 
four possible pairs of A and B outcomes, (±1,±1). It is 
required that the measurement outcome statistics for A 
(B) measured alone can be obtained from the joint ob- 
servable by disregarding (summing through all possible) 
outcomes for B (A). Hence, the defining condition for G 
is that 



A(±l) 
B(±l) 



G(±l,l) 
G(l,±l) 



G(±l,-1) 
G(-l,±l) 



(2.1a) 
(2.1b) 



In other words, A and B are marginals of G. A possible 
choice is 



G(+l,±l) 
G(-l,±l) 



^+^('^x±cry) 
^--/I^Tf^y) 



(2.2a) 
(2.2b) 



It is easy to verify that G indeed fulfills the requirements 
(|2.ip and that each G(±l, ±1) is a positive operator. Var- 
ious ways to realize G and other related measurements 
have been discussed (e.g., in [2|, Ea])- 

So far, our discussion has concerned only joint mea- 
surability of observables (i.e., compatibility of measure- 
ment outcome probabilities). There is also another level 
of compatibility, arising from the fact that a (nontriv- 
ial) quantum measurement necessarily affects the state of 
the measured system. Thus, each measurement outcome 
has an associated operation, which is mathematically de- 
scribed as a completely positive trace-nonincreasing map- 
ping on the set of states. The collection of all these op- 
erations forms an instrument Il9l. 



The standard measurement for a discrete sharp observ- 
able is the so-called von Neumann measurement. The 
corresponding instrument, which we call the von Neu- 
mann instrument, has a very simple form. In our case, 
the von Neumann instruments X and y associated with 
the sharp observables X and Y, respectively, are given by 



X±{q) = X(±l)gX(±l), 

y±{Q) - Y(±i)^Y(±i). 



(2.3a) 
(2.3b) 



For instance, if the system is in a state g and a measure- 
ment of X gives the outcome 1, then the unnormalized 
output state is X+{q)\ see Fig.[ljb). We can also write 

A'+(e)=X(l)eX(l) = tr[gX(l)]X(l), 

which shows that the normalized output state is X(l). 

Since X and Y are not jointly measurable, none of 
their instruments can be jointly measurable. In particu- 
lar, there is no measurement scheme which would realize 
both X and y. Therefore, if we want to realize the in- 
struments X and 3^ in a single measurement scheme, we 
need to approximate them. 

In the case of the approximating observables A, B the 
von Neumann instruments are commonly replaced by 
Liiders instruments C^ and C^ , defined as 



£lig) = v/B(±l)ev/B(±l) 



(2.4a) 
(2.4b) 



For the operator A(l), the square root a/A(1) takes the 
form, 



vw)=\m^m)+\i 



1-77 



X(-l) 



Hence, we see that 

^+{0) = 1(1 + r?)A'+(£i)+ other terms. 

In this way, we can understand the C^ measurement as an 
approximate version of the X measurement. We refer to 
[201 for a convenient summary of the Liiders instrument 
in general. 

In the limiting case 77 = 1 when A = X and B = Y 
the formulas (|2.3p and (|2.4|) coincide. Hence, we would 
expect that the Liiders instruments of A and B are good 
approximations to the von Neumann instruments of X 
and Y. Here, however, we face a problem. It was shown 
in [21| that two Liiders operations £^ and £^ are jointly 
measurable if and only if [25| either A(l) -I- B(l) < / 
or B(l) = 5A(1) for some < 6 < 1. Since neither of 
these two conditions holds in our situation, we find that 
the Liiders instruments C^ and C^ cannot be realized 
in a single experimental setup. Therefore, they do not 
provide the jointly measurable approximations that we 
are looking for. 

We conclude that the obvious replacements for the von 
Neumann instruments of X and Y, namely the Liiders 




B(0;7) 



^'7!^^^- 



FIG. 2: Visual representation of the four-ball intersection 
determining the existence of joint observable for A and B 
parametrized by 7 and vector g. If 7 = 1/2, all four balls 
intersect in a single point (white circle) defining the joint ob- 
servable (solid arcs). If 7 differs from 1/2 (e.g. 7 = 0.4 as in 
the figure), two opposite circles intersect, while the other two 
do not; in such a case there is no joint observable with given 
7 (dot-dashed arcs). 



instruments of A and B, are not jointly measurable, al- 
though the observables A and B are. On the other hand, 
joint mcasurability of A and B implies that they have 
some jointly measurable instruments. In fact, every in- 
strument implementing a joint observable of A and B 
gives instruments for A and B as its marginals. In the 
following we will characterize the joint instrument which 
gives the best approximations for the von Neumann in- 
struments of X and Y. 



III. JOINT OBSERVABLE 

In this section we derive some useful properties of the 
joint observable G, defined in (|2.2p . Let us first make a 
general observation. Suppose that A and B would have 
a second joint observable G'. Then, also all the convex 
combinations AG + (1 — A)G'. < A < 1, defined as 
(i,J=±l) 

[AG + (1 - A)G'](i, j) = XG{i,j) + (1 - X)G'{i,j) , 

arc joint observables of A and B. This leads to the conclu- 
sion that A and B either have a unique joint observable or 
uncountably many different joint observables. The case 
under investigation falls, luckily, into the first class. This 
is essential for our investigation as it crucially limits the 
search for optimal joint instruments. 

To see that G is a unique joint observable for A and B, 
we first notice that any joint observable G' for A and B is 



completely determined by a single operator, say G'(l, 1). 
The other operators are then recovered from the marginal 
conditions (|2.ip . Since G'(l, 1) is a positive operator, we 
can write it as 

G'(l,l) = i(7/ + g.^), 

where 7 > and g G M^. As noticed in [l[, the conditions 
for G'(l, 1) to define a joint observable for A and B are 
the following operator inequalities: 

0<G'(1,1) < A(l), 
A(l) + B(l)-/< G'(l,l) < B(l). 

These are equivalent to the requirement that the vector 
g is in the intersection of four balls: 



geB(0;7)nB(— x;l-7 

nB(i=y;l-7)nB(-L(x + y);7 

Since x and y are orthogonal unit vectors, the inter- 
section is nonempty only if 7 = i and in that case 
g = 77^ (x + y); see Fig. [21 This means that G' = G, 
proving that there is only one joint observable for A and 
B. 

It may be worth emphasizing that the joint observable 
of two observables is unique only in special cases. For in- 
stance, if instead of taking rj — -j= we would have chosen 
a smaller number in the definition of A and B. then they 
would have infinitely many joint observables. This is as 
well evident from Fig. [2 

Let us denote by fJ.±±{g) = tr [gG(±l, ±1)] the prob- 
abilities observed in the G measurement. If a state g is 
written as g = ^(/ -I- r • cr), then 



^^3kig) 



1 



1 

V2 



{JY • X + /cr • y) 



j, fc = ± . 



It is now straightforward to see that 

^J■+^{.g) + ^^-±{g) = -^ 

for all states g. This implies that the whole probability 
distribution is actually determined only by two numbers 

[e.g., fi++{g) and fi^ (g)]. We further notice that the 

numbers /i-|__|-((3) and /i_| (g) satisfy 



M++(£') 



^J'+-{Q) 



< 



This inequality characterizes the convex set of all possible 
probability distributions in the range of G. 



Due to the simple structure of G, we can characterize all 
its instruments G in an uncomplicated way. Let us, for 
a moment, concentrate on G++, an operation associated 
with the outcome combination (1,1). 

We first observe that 2G(1, 1) =: i? is a one-dimen- 
sional projection. Let {K^} be the set of Kraus operators 
for G++, so that 



G++{g) = ^KegK^, 
J^K^Ki = G(l,l) = ii?. 



The last equation implies that for each €, we have 
2KgKe < R. Since i? is a one-dimensional projection, 
there is a number < fc^ < 1 such that 2KfKi = kgR. 
Clearly, 'Y^gki = 1. Let Kg = Ui \Ki\ be the polar de- 
composition of Kf. Here C/^ is a unitary operator and 



For every state g, we then get 

K^gKl = \kiUtRgRU; ^\ki,iY[gR]U^RUt 
= iv[gQ{l,l)]kiUiRU; 

and hence 

G++{g) = tr [eG(l, 1)] Y, hUiRU*, . 

e 

Each UeRU^ is a one-dimensional projection and the con- 
vex sum. 



Y, hUiRu; = i 



++J 



is therefore a state. 

A similar calculation can be performed for the other 
three operations separately. Hence, we conclude that 
an instrument G implementing G is determined by four 
states ^±±, and the corresponding operations are given 
by 

Q±± [g) = tr [eG(±i, ±1)] e±± = ^Ji±± {g)^±± ■ (4.1) 

As we have seen, this simple structure of the instruments 
implementing G is due to the fact that each element 
G{i,j) is a rank-1 operator. 

Finally, let us emphasize that the probabilities iJ^±±{g) 
are fixed since G implements the observable G. The free- 
dom we have is only in the choice of the four states £,±± . 



IV. JOINT INSTRUMENT 

An instrument G implementing the joint observable G 
consists of four operations Gjk, 3, fc = ±, satisfying 

tT[Gjk{g)]^tT[gG{j,k)] . 



V. OPTIMAL APPROXIMATION 

A. Distance between operations 

We are seeking for the best simultaneous approxima- 
tion to the von Neumann instruments associated with X 



and Y. Wc therefore perform a measurement of G, which 
is described by an instrument G of the form (|4.ip . 

In a similar way as G gives A and B as its marginals, 
G determines marginal instruments A and B. Our aim is 
that the following approximations should be as close as 
possible: 



A+ = g++ + g+- r 


^ x^ 


B+ = ^++ + ^-+ ^ 


- y+ 


A- = g^-^Q-+ r 


- x^ 


B. = g.-+g+- r 


- y- 



As we are already using the unique joint observable 
G of the optimal approximating observables A and B, 
the measurement outcome probabilities are set and do 
not depend on the choice of g. Therefore, in order to 
quantify the distance between a given approximation and 
the corresponding von Neumann instrument, it is enough 
to compare the normalized output states. 

There are various options for how to quantify the dis- 
tance between Hilbert space operators. However, when 
considering the distance between density operators it is 
natural to choose the one induced by the trace norm. 
Operationally, it quantifies the optimal probability with 
which the states can be discriminated in a single run of 
the experiment (i.e., by observing a single experimental 
click [2II). 

The distance exhibiting the difference between the out- 
put states for a given pair of operations can be utilized to 
induce a distance between the instruments. In particular, 
in what follows we will analyze the average distance over 
all input states (Sec. IV B|) and the worst-case distance 
(Sec. IV C|) . Our interest is to minimize their values for 
all outcomes. 

If we measure X and obtain the outcome 1, then the 
output state is X(l). On the other hand, if we measure 
A and obtain the outcome 1, then the output state is 



tr[^+(g)] 



/^+-h(£') 



^t++(e)-hM+-(e) 
A*+-(£') 



e++ 



-A^ 



The trace distance of the approximation A+ from the 
desired operation X+, given that the input state is g, is 
thus 

A+{g) 



d+^iQ) 



-X(l) 



tv[A+ig)] 
where the norm on the right-hand side is the trace norm. 

B. Optimal approximations under average distance 



Assigning Bloch vectors q±± G M.^ to the states ^. 



jA,X 



the distance d_^J (g) can be written in the form, 



d^ig) 



q++ + q^ 



fig)- 



±±, 



(5.1) 



where r ~ {fxifyTfz) is the Bloch vector corresponding 
to g, f{g) = f{rx,ry) = ry/{^/2 + r^) and the norm is 
the Euclidean norm in 'M? . Similarly we get 



d^+'^ig) 



d'j'^ig) 



d'^-'ig) 



q 



q— + q-+ 



f(n\^++ 


q + 


J\Q) 


2 


fis)"^— 


q + 


J\Q) 


2 


f(.^'^- 


q+ 



Since both the vectors x, y have a vanishing z com- 
ponent, it follows that setting the z component of our 
choice of Bloch vectors q±± to zero decreases the dis- 
tances. Hence, in optimization tasks we can restrict our- 
selves to vectors q±± with the vanishing z component. 



We denote by 



the normalized integration over 
rll^ = rl +rl + r^ < 1} 



the Bloch ball B = {r e K' . i,.,, - .^ , ^y 
representing the state space of a qubit. Hence, for a 
function F defined on the state space we have 

{F{g))g^^ [ F[p{v)]drxdrydr,. 

We now define (for the outcome A+) the average distance 
to be fH 



A.X 



s 



We have chosen d^ [g) 

plify the calculations. 
We can write (15.11) as 



d'Ag) 



jA.X 



(?) 



jA,X 



instead of d , ' {g) just to sim- 



J l|q++ + q+- - 2xf 

+ J/(ef llq++-q+-ll 



-2/(e')(q+++q+- -2x)•(q+^ 



q+-)- 



As {f{g))g = Q and 



3V2, 



(/(e)'), = 2-^ln(l + V2)^ 



»A.X • 



the average distance S)_|_' is expressible in the form 



D 



A.X 



1 



|q+4 



+ "4 
Analogously, we get 

2) 

^B,Y 



r> l|2 , " II 

q+- -2x|| +^l|q+- 



,A,X 



1 2 Ct 

^l|q++ + q-+-2y|| +^llq+- 



q+-l 



q-+l 



2x||' + |||q— -q- 



2). 



4iiq- + q-H 

^ l|q- + q+- + 2yf + I llq- - q+- f 



>A,X 



The distance 15 ,' can be made zero by taking q-|-+ 



q±_ = ±x. This gives also S) 



A,X 



0. But we will then 



have S 



B,Y 



1 + a and the sum is then S) = S) 



A.X 



j,A,x ^ j,B,Y ^ j,B'Y ^ 2(1 + a). As we shaU see this 
choice does not achieve the minimal value of the sum S). 
We want to make all the four distances as small as 
possible, under the condition that they are equal. To 
find the optimal instrument, we consider the sum S = 
J)_i_' +£)_!_' + £>_■ + Sj , which is obviously a convex 
function of vectors q±±. Thus, its minimization subject 
to the conditions ||q±±|| < 1 is a convex optimization 
problem. Since S is differentiable, the optimality crite- 
rion (see e.g. (23|) for an instrument defined by a quadru- 
ple q= (q++,q+_,q_+,q__) is that 



VS)(q)^-p> VS)(q)^ -q 



(5.2) 



for all p = (p++,p+_,p_+,p — ) with ||p±±|| < 1. 
ft is now easy to verify that the optimal solution is 
achieved when q±± = (±x±y)/\/2 and this choice gives 
j,A.x ^ j,B.Y ^ ^A.x ^ ^By = 1(3 _ 2V2 + a). We can 
also compare this solution with the example where one 
direction was preferred, and we see that 



So 



pt 



2(3-2V2 + a) < 2(1 + a) 



The optimal joint instrument corresponds to the pure 
states ^^±_ = 2G(±1,±1), and we recognize it being the 
Liiders instrument C^ of G, given as 



Cl^ig) = v/G(±l,±l)gVG(±l,±l) 
= tr[G(±l,±l)g]er±. 



(5.3) 



In summary, we have found that the Liiders instrument 
C^ of G gives the optimal joint approximation of X and 
y, the quality of approximations being quantified using 
the average distance. 



C. Optimal approximations under the worst case 
distance 

The optimal joint instrument naturally depends on the 
quantification of the distance between two operations. 
We believe that the average norm studied in Sec. IV Bl is 
the most relevant way to measure the distance. 

If our task was to discriminate the given pair of instru- 
ments, the average norm would then quantify the aver- 
age success probability under supposition of choosing the 
test state p randomly. As a comparison we take a look 
also on the worst- case distance, which determines test 
states allowing the best possible discrimination of the 
two instruments. In this sense the worst-case distance 
optimizes the distinguishability over the test states. The 
worst-case distance is defined as 



2JjA,X 



supd^' (g) 



We want, again, to find an instrument which minimizes 
these distances under the condition that they are all 
equal. 

Let us observe that the Bloch vector of a normalized 
outcome state A+{g)/tT [A+{g)] is of the form. 



ljl + f{g)]ci++ + l[l-f{g)]q+- 



jk.Xi 



(5.4) 



Therefore, the distance dj^ {g) is the length of a vector 

being a convex combination of vectors q++ — x and q^ — 

X. We thus conclude that 



W_ 



A.X 



max{||q++ - x|l , |lq+_ -x|l} 



(5.5) 



Similarly, we get 



2IJ_|_' = max{j|q. 



->A,X 



-y|| J|q-+ -y||}, 

tYVT-rt •,^ fil I MM I NT 

2n_ = max{||q__ + x|| , ||q ^+x||}, 

21T!-^ = max{||q__+y||,||q+_+y||}. 

The distance 2If_|_' in (|5.5p can be made zero, and this 

happens if and only if q4--|_ = q_| = x. If q_|__|_ and 

q.) are chosen in this way we are still free to choose 

q |. and q , hence wc can also achieve 22J_' = by 

taking q = q |_ = — x. However, we will then have 



2IT' 



B.Y 



= V2 and 211 = 2n^'^-|-2IJ+'"^+2n'^'^+2rr^''^ = 2^/2. 



The value 2U = 2-\/2 for the sum is achieved also for the 
symmetric choice q±± = (±x±y)/2. In this case all the 



,A,X 



W 



B,Y 



W 



A,X 



W 



B.Y 



distances are equal, 2U^ 
1/^2. 

Let us then minimize the distances under the con- 
dition that they are all equal. First, we require that 



2jjA,X 



W 



B,Y 



and we minimize these two distances. 



ignoring for a moment the other two distances. It is 
easy to see that in the optimal case it is necessary to 
choose q++ = (x + y)/2. Similarly, if we require that 
W^ = 2ir_' and we minimize these two distances in- 
dependently of the previous minimization, wc see that it 
is necessary to put q = (— x — y)/2. These two op- 
timal choices are possible simultaneously only if we set 

q^ = (x — y)/2 and q y = (— x + y)/2. In conclusion, 

the symmetric choice q±± = (±x ± y)/2 is optimal. 

The instrument Q corresponding to this symmetric 
choice of ^±± can be written as 



G±±ig) = ^Cl±ig) 



tr[eG(±l,±l)] 



1- 
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tT[gGi±l,±l)]a± 
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tr[gG(±l,±l)]i/ 






1 

V2' 



(l-^)l^ 



(5.6) 



Hence, t/ is a mixture of the Liiders instrument C and 
another instrument, which has very simple form. In par- 
ticular, Q can be realized by using the Liiders instrument 
C , accepting the measurement outcomes but ignoring 
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eight outcomes. An observable defined as 



FIG. 3: Bloch vector representation of instrument Q gives 
us in the case of approximation 3+ the resulting normalized 
output states lying on a (thick horizontal) blue line. The av- 
erage distance is minimized for the choice of symmetrical pure 
states C±±. In the case of approximation A+ the normalized 
output states are represented by (thick vertical) red line with 
red dots representing the worst cases. These states are closest 
to X when the output state is noisier than in the average case. 



the output state in 1 — -j= parts of the measurement and 

preparing the maximally mixed state ^/ in these cases. 
Previous results can be visualized as given in Fig. |31 
In the case of the average distance [horizontal blue line 
represents possible normalized output states as a mani- 
festation of (|5.4p ] we find it is natural to expect as large 
Bloch vectors q±± as possible, but for the worst case dis- 
tance the additional noise in the normalized output state 
makes it closer to the desired state (vertical red line with 
red dots representing worst cases). 



VI. APPROXIMATE JOINT MEASUREMENT 
OF THREE VON NEUMANN MEASUREMENTS 

Let z be a unit vector which is orthogonal to both x 
and y, and let Z be the corresponding sharp observable. 
Suppose we want to approximate the von Neumann in- 
struments of X, Y, and Z. This problem of additional 
measurement bears some differences with the previously 
studied approximation task of two von Neumann mea- 
surements. 

The optimal jointly measurable approximations of X, 
Y, and Z are given by 



A(±l) 
B(±l) 
C(±l) 



^(^±Wx), 
-(/±77cry), 



with V = ^- The observables A and B are of similar 
form as before, but we have to decrease t] from -j= to 
-j= to make it possible to include the additional spin 
component direction. It has been proved in [2J] that for 
r] > -1= the three observables are not jointly measurable. 
Generally, a joint observable E for A, B, and C has 



E(±l,±l,±l) 



1 



I + ~7^i^^^ ± CTy ± CTz) 



is a joint observable for A, B, and C since it satisfies the 
marginal conditions, 

A(±l) = J2 E(±l,j,fc), 
j,fee{-i,i} 

B(±l) = J2 E(j,±l,fc), 
j,ke{-i,i} 

E E(j,fc,±l). 
i,fce{-i,i} 



C(±l) 



Unlike in the earlier situation, now we have several dif- 
ferent joint observables. Another joint observable F for 
A, B, and C is given by 

F(l,l,l) = 2E(1,1,1), 

F(l,-1,-1) = 2E(1,-1,-1), 

F(-l,-l,l) = 2E(-1,-1,1), 

F(-l,l,-l) = 2E(-1,1,-1), 

F(l,l,-1) = F(1,-1,1) = F(-1,1,1) 

= F(-1,-1,-1) = 0. 

A notable feature of the observable F is that it is es- 
sentially a four-outcome observable. Although A, B, and 
C have infinitely many different joint observables, F is 
the unique joint observable having only four nonzero el- 
ements. To demonstrate this fact, suppose that F' is a 
joint observable of A, B, and C, and that 

F'(l,l,-1) = F'(1,-1,1) = F'(-1,1,1) 
= F'(-1,-1,-1) = 0. 

First of all, let us notice that F' is completely determined 
by a single nonzero element, say F' (1,1,1). The other 
operators are given by the marginal conditions. For in- 
stance, F'(l, -1 - 1) = A(l) - F'(l, 1, 1). Since F' is an 
observable, the operators F'(i, j, k) must sum up to iden- 
tity. Hence, we get 

/ = F'(l,l,l) + F'(l,-l,-l) + F'(-l,l,-l) 
+F'(-1,-1,1) 
= F'(l, 1, 1) + [A(l) - F'(l, 1, 1)] + [B(l) - F'(l, 1, 1)] 
+ [C(1)-F'(1,1,1)]. 

Therefore, the operator F'( 1,1,1) is determined by the 
equation, 

F'(l,l,l) = i[A(l) + B(l) + C(l)-/]. 

It follows that F' = F, and the four-outcome joint observ- 
able is hence unique. 

As done previously for the joint observable G, we can 
now study the instruments implementing E and F. The 



elements forming E and F are rank-1 operators. There- 
fore, our characterization for instruments implementing 
G in Sec. II VI applies to these two observables as well. 

An instrument implementing E is determined by eight 
states C±,±.± a-iid the corresponding operations are 

£±±±ie) = tr bE(±l, ±1, ±1)] c±±± 

The approximations to von Neumann instruments X, y, 
and Z are now defined as follows: 



A± — £±++ + £± — h + £± — + £±+- 
B± = £+±+ + £-±+ + £+±- + £-±- 
C± = £++± + £ — ± + £+-± + £-+± 



X±, 



The optimal instrument under the average distance can 
be deduced by following a similar procedure as the one 
presented in Sec. IV Bl We thus determine the minimum 



of the sum S) — J),,.' - 
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where now, for instance, the distance S)_)_' is 
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and 
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i[7-3\/31n(2 + \/3)] 



Using the criterion (|5.2p . it is straightforward to ver- 
ify that the optimal solution is achieved when q±±± = 
-7= (±x ± y ± z). All the distances are then equal and 

take the value |/? + (1 7?)^- The related joint instru- 
ment is the Liiders instrument £^ of E. 

An instrument J- implementing F is, again, determined 
by eight states. However, the states corresponding to the 
zero elements play no role, so T is actually determined by 
four states only. Since F gives different measurement out- 
come probabilities than E, the related average distances 
are also different. So, if we consider F, we get 
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35 J achieves its minimum when the four relevant Bloch 
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FIG. 4: Bloch vector comparison of the four- and eight- 
outcome instruments. The four-outcome instrument leads to 
normalized output states from the edges of the tetrahedron 
(especially for the measurement along the x axis illustrated by 
the thick red diagonal). The eight-outcome instrument leads 
to the faces of the cube given by the vectors q±±± (especially 
for the measurement along the x axis illustrated by the blue 
circle). 



This choice gives 1)')^ = f 7 + (l ^ 75) ■ The related 

joint instrument is the Liiders instrument C^ of F. 

We can, again, use Bloch representation and write 
the normalized output states similarly as in (|5.4p . For 
the four-outcome instrument we get a similar result — 
the normalized output states are convex combinations of 
the corresponding states defining the instrument. For 
instance, the normalized output state, when measuring 
along axis x and obtaining the outcome -1-1, is a convex 
combination of q+++ and q-| Hence, these Bloch vec- 
tors of output states lie on the edges of a tetrahedron as 
depicted on Fig. 0] (thick red line). 

For the eight-outcome instrument we find that the cor- 
responding output states form a set 



V3 



(x + fyy + f^z) 



fl^fl<\. 



This set is a circle lying in the plane given by the vectors 
q+jfe, j, fc G {—1, 1} [i.e., being inscribed into the face of 
the cube such as is depicted in Fig. S] (blue circle)]. From 
this geometrical representation we can confirm the fact 
shown earlier; under the usage of the average distance the 
four-outcome instrument is worse that the eight-outcome 
instrument as it has a contribution from states being fur- 
ther from the reference state. 



VII. CONCLUSIONS 

The impossibility of joint measurements of orthogonal 
qubit measurements leads us to the study of their ap- 



proximations. We have considered not only observables, 
but we have accessed the problem from the perspective of 
instruments. We have characterized the optimal approx- 
imation of two von Neumann instruments, and the opti- 
mal joint instrument was found to be the Liiders instru- 
ment of the optimal (unique) joint observable. This result 
[see Eq. ()5.3|) ] was achieved by searching for the best ap- 
proximation under the average distance between the nor- 
malized output states. When considering the worst case 
norm, the resulting optimal instrument is a mixture of 
the corresponding Liiders instrument and the state-space 
contraction into the complete mixture [see Eq. (|5.6p ]. 

A similar investigation was performed in the case of 
three von Neumann instruments, but then we faced the 
problem of non-uniqueness of the optimal joint observ- 
able. Nevertheless, in the two commonly used instances 
the Liiders instrument was found to be optimal. Al- 
though the numerical value of the minimal average dis- 
tance does not have any intrinsic meaning, one can use 
it in order to compare the minimal distances in the three 
studied cases: approximation of two von Neumann in- 
struments and approximations of three von Neumann in- 
struments with eight- and four-outcome measurements. 
The minimal average distances for these approximations 
arc 0.15, 0.23, and 0.28, respectively. As one would ex- 
pect, the average distance can be made lowest in the first 
case. Namely, it is certainly easier to approximate two 
von Neumann instruments rather than three. Further- 
more, it is not surprising that in the latter two cases 



eight instead of four outcomes are more efBcient in the 
approximation task. This may be explained by a broader 
set of outcomes to choose from, leaving less space for er- 
ror. 

We believe that Liiders instruments are optimal joint 
instruments for a more general class of situations than 
only those studied here. A natural extension of the two 
orthogonal qubit observables would be the case of two 
sharp observables related to mutually unbiased bases. 
Another interesting class is that of the continuous vari- 
able systems, where phase space observables play the role 
of joint observables. These generalizations merit further 
study. 
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